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Diabetes mellitus is a chronic, life-threatening, and complicated condition. 
Around 1.5 million deaths due to diabetes have been documented, according 
to a World Health Organization (WHO) estimation in 2019. In the world of 
medicine, predicting diabetes risk is a difficult and time-consuming task. 
Many past studies have been conducted to investigate and clarify diabetes 
symptoms and variables. To solve these persisting issues, however, more 
critical clinical criteria must be considered. A comparative analysis based on 
three soft computing strategies for diabetes prediction has been carried out 
and achieved in this work. Among the computational intelligence methods 
used in this study are fuzzy analytical hierarchy processes (FAHP), support 
vector machine (SVM), and artificial neural networks (ANNs). The 
techniques reveal promising performance in predicting diabetes reliably and 
effectively in terms of several classification evaluation metrics, according to 
experimental analysis and assessment conducted on 520 participants using a 
publicly available dataset. 
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1. INTRODUCTION 

One of the most prevalent endocrine diseases is diabetes mellitus. That requires ongoing medical 
care with several strategies to reduce the external risk of glycemic control. An imbalance occurs in the 
person’s nutritional metabolism, leading to many complications and long-term effects, including the heart, 
kidneys, eyes, nerves, and blood vessels. To diagnose symptomatic diabetes by doctors, the patient shows 
many signs and symptoms resulting from the osmotic separation causing high blood sugar [1]. The diseases 
of diabetes complications can be divided into two combinations according to their damage, including 
macrovascular damage (the arteries) and microvascular damage (small blood vessels). Accelerated 
cardiovascular disease, which presents as strokes and other catastrophic illnesses, is the most devastating 
macrovascular consequence. Microvascular illnesses such as retinopathy in the eye, nephropathy in the 
kidney, and neuropathy in the nervous system are examples of organ-specific disorders [2], [3]. 

According to the World Health Organization (WHO), diabetes affects 422 million people worldwide 
in 2018 (WHO). Type 1 and type 2 diabetes are the two forms of diabetes. Type 1 diabetes mellitus is an 
autoimmune disease that can strike anyone at any age, but it strikes children and adolescents more frequently. 
The immune system incorrectly destroys pancreatic beta cells, resulting in total insulin insufficiency, a small 
amount of insulin released into the body, or even no insulin released into the body. Mellitus is a Type 2 
diabetes complication that arises when the body loses b-cell insulin secretion over time, generates insufficient 
insulin, or stays insulin-resistant. Gestational diabetes, on the other hand, is caused by hormonal changes that 
only occur during pregnancy. Type 1 diabetes, Type 2 diabetes, gestational diabetes, and other types of 
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diabetes caused by various factors are classified by some researchers and the American diabetes association 
(ADA) [4]. Gestational diabetes mellitus (GDM) is a kind of diabetes that only affects pregnant women and 
usually develops between the 24th and 28th week of pregnancy (except for women who already have chronic 
diabetes). When blood glucose levels rise over normal during pregnancy, it is detected. After the baby is 
born, the majority of mothers will not get diabetes. After childbirth, however, some women will continue to 
have high blood glucose levels [5]. 

According to diabetes Australia, diabetes can be present for up to seven years before clinical 
diagnosis. During this time, a person may acquire potentially fatal conditions such as blindness from eye 
damage, foot ulcers that may need amputation of the affected limbs, renal failure, and heart attacks [6]-[10]. 
Fiarni and others coined the term "silent killer" to describe it for these reasons [3]. With regular 
examinations, early detection, and treatment initiation, these repercussions can be prevented, managed, or 
even eliminated in some persons, saving roughly 1415 US dollars [5]. 

Artificial intelligence technologies are routinely employed to detect and diagnose diseases 
automatically. The authors in [11] recommended the use of a subset evaluator (CSE) as a method for 
identifying the most important risk variables for diabetes prevalence in the body. Based on the Pima Indian 
Diabetes dataset, the authors combined CSE and decision tree (DT) to create a classifier subset evaluator 
decision tree (CSE-DT) (PIDD). Moreover, Shuja et al. [12] developed a two-stage approach for diabetic 
prediction based on data mining categorization techniques: For data preprocessing, the initial stage is 
synthetic minority oversampling technique (SMOTE). Five machine learning classifiers are used in the 
second stage: simple logistic, decision tree, bagging, artificial neural networks (ANN), and support vector 
machine (SVM). Swapna et al. [13] a method for distinguishing normal heart rate variability (HRV) signals 
and diabetes was proposed utilizing deep learning architectures. To construct an integrated system, the 
authors merged a convolutional neural network (CNN) with long short-term memory (LSTM). Multiple 
criteria decision making (MCDM), a sub-field and key branch of operations research (OR) have been widely 
employed in many scientific domains for problem-solving and decision-making in addition to artificial 
intelligence algorithms [14], [15]. The MCDA or MCDM problem normally comprises four phases: 
formulation of options, criterion selection, criterion weighting, and decision making [16]. By merging 
MCDM approaches with artificial intelligence techniques, particularly soft computing technologies, hybrid 
methodologies can be constructed. The fuzzy analytic hierarchy process (FAHP) is a multi-criterion decision- 
making technique analytic hierarchy process (AHP) that incorporates fuzzy theories (a branch of soft 
computing) [17]. 

In the literature, there have been many methods developed for diabetes prediction. logistic 
regression is one of the classification techniques used to predict diabetes [18]. The authors employed seven 
factors as attributes in their data analysis, resulting in a prediction probability of 78.5565. Another work, 
Morgan et al. [19] used the world health survey plus (WHS+), which was performed with WHO support 
across five Gulf cooperation council (GCC) nations in 2008 and 2009, including the UAE, Kuwait, Saudi 
Arabia, and Oman. The sample sizes for the authors were UAE (n=2569), Kuwait (n=3828), Saudi Arabia 
(n=8629), and Oman (n=4717). According to the findings, Oman has the lowest standardized prevalence of 
diabetes at 8.5% (7.4%-9.8%), followed by Saudi Arabia at 10.5% (9.6%-11.4%), the United Arab Emirates 
at 13.2% (11.4%-15.2%), and Kuwait at 15.3% (13.9%-16.8%). Singh et al. [20] The Pima Indians diabetes 
dataset was also used to develop machine learning approaches for diabetes diagnosis, including likelihood- 
based navie bayes (NB), decision tree-based random forest (RF), and multi-layered function-based random 
forest (RF) (MLP). They demonstrated that data pre-processing can improve the performance of machine 
learning algorithms. 

Vidhya and Shanmugalakshmi [21] considered many risk factors to predict diabetes mellitus 
including, the patients’ body mass index level, bad eating habits, poor exercise, smoking, nature of work, and 
other factors regardless of the age or gender of the patient. They found the existing techniques, ANN and 
SVM, achieved an accuracy of 57.41% and 62.81%. In comparison, deep belief network (DBN) achieved an 
accuracy of 80.99%, achieving better results than the other machine learning methods. Ding et al. [22] 
proposed a novel approach for predicting diabetic complications based on the similarity enhanced latent 
Dirichlet assignment (seLDA) model. After preprocessing the data, they computed the similarity between 
each pair based on medical records, then used the similarity estimates as constraints in seLDA-based diabetes 
complications mining. The proposed approach (SVM-seLDA) consistently beat the traditional and seLDA- 
based approaches by 22.49% in estimating similarity and predicting diabetic complications, according to the 
experimental data. Liu et al. [23] the disease type 2 diabetes mellitus (T2DM) was used as a case study, with 
the focus on issues that occur after the first diagnosis. Modeling some risk complications and analyzing the 
linkages between risk factor selection patterns are the foundations of their strategy. They concluded that the 
Bayesian hierarchical framework outperformed the most recent models. 
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FAHP has been used in many applications and different fields, such as weapon selection [24], 
personnel selection [25], job selection [26], energy alternatives selection [27], and performance evaluation 
systems [28], [29]. Chamodrakas et al. [30] used a fuzzy AHP method for supplier selection in electronic 
marketplaces. Similarly, Kilincci et al. [31] used the fuzzy AHP approach in the washing machine company 
for selection purposes. Shaw et al. [32] proposed combining fuzzy AHP and fuzzy objective linear 
programming to better select a supplier for developing a low-carbon supply chain. Furthermore, Arikan [33] 
to handle multiple objective supplier selection challenges, we created an interactive solution using Fuzzy 
AHP. Their proposed strategy had three objectives: reduce total financial costs, improve overall quality, and 
improve customer service. 

Data mining techniques were used in the most recently established diabetes prediction algorithms 
[34]. Islam et al. [34] a dataset of 520 occurrences was examined using three machine learning techniques: 
the Naive Bayes method, the logistic regression algorithm, and the random forest algorithm. Random forest 
algorithms produced the most accurate results, according to their research. To predict diabetes 
Garcfa-Ordas et al. [35] used state-of-the-art deep learning approaches. On the Pima Indians diabetes dataset, 
they used a variational autoencoder (VAE) to increase data, a sparse autoencoder (SAE) to increase features, 
and a CNN for classification, using a variational autoencoder (VAE) to increase data, a sparse autoencoder 
(SAE) to increase features, and a CNN to increase classification. 

Despite the researchers’ best efforts, more research and comparison of diabetes prediction 
approaches are still needed and open for investigation. For early diabetes prediction, this paper presents a 
comparative study involving three different soft computing techniques and a multi-criteria decision analysis 
method. For diabetes prediction and classification, the researchers used the fuzzy analytical hierarchy process 
(FAHP), artificial neural network (ANN), and support vector machine (SVM) methodologies. Selecting these 
three soft computing approaches in our study is based on the diversity of concepts and strength/weaknesses 
of those methods. This is the first study conducted to compare the performance of a multi-criteria decision- 
making method for diabetes prediction to the other computational intelligence methods. On publicly available 
datasets gathered from 520 patient and control subjects, the methods were tested. 

The remaining sections of this work are presented as in: the approaches were provided and 
explained in section 2. The experimental data and comments are presented in section 3. Finally, section 4 
brings the process to a close. 


2. RESEARCH METHOD 
To conduct our comparative study, three methods have been considered. The description of these 
methods is presented briefly in this section as in: 


2.1. Fuzzy analytical hierarchy process (FAHP) 

Saaty [17] a multi-criteria decision-making technique called the FAHP was devised. To build 

FAHP, the fuzzy theory is incorporated into the fundamental AHP. The AHP approach, for example, has 

been used in the past to forecast sickness [36]. As a result, FAHP is used to predict the occurrence of diabetes 

in this study. FAHP is a decision-making procedure that is frequently employed in cases involving 
conflicting criteria. The pairwise comparisons of the criteria and alternatives are implemented in FAHP by 
using triangular numbers to represent the variables [37]. The steps of the FAHP method can be described as: 

- Step 1: By using the language concepts in Table 1, the decision-maker compares the criteria and options. 
As a result, the comparison matrix can be created. In terms of the fuzzy triangular scales that these 
linguistic concepts correspond to, for example, if the decision-maker declares that Criterion 1 (C1) is less 
important than Criterion 2 (C2), it uses the fuzzy triangle scale as: (2)-(4). In the pairwise contribution 
matrix of the criteria, the comparison of C2 to C1 will use the fuzzy triangle scale (1/4, 1/3, 1/2). The 
pairwise contribution matrix is shown in (1), where signifies the zth decision maker's preference for the 
xth criterion over the yth criterion, using fuzzy triangular scales. “Tilde” indicates to the triangular scale 
description, for the instance, the first decision maker's preference of criterion! over criterion 2, equals to 
mi, = (2,3,4). M matrix of pairwise comparison is formed (1), where mz, denotes that the zth makes 
the selection of the xth dimension over the yth dimension. 


eZ Z WA 
miı miz we Mn 
im Ze WA 
ms m aes we mMm 
MZ = 21 2n (1) 
me, MZ, My 


- Step 2: Priority of each defendant (mz, )is collected i, using (2): 
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Table 1. Linguistic terms and the corresponding fuzzy triangular scales 


Integrate Scale Linguistic Description Fuzzy Triangular Scale 
1/9 Extremely less important (1/9, 1/9, 1/9) 
1/8 The intermediate values between two adjacent scales (1/9, 1/8, 1/7) 
1/7 Very strongly less important (1/8, 1/7, 1/6) 
1/6 The intermediate values between two adjacent scales (1/7, 1/6, 1/5) 
1/5 strongly less important (1/6, 1/5, 1/4) 
1/4 The intermediate values between two adjacent scales (1/5, 1/4, 1/3) 
1/3 Moderately less important (1/4, 1/3, 1/2) 
1/2 The intermediate values between two adjacent scales (1/3, 1/2, 1/1) 

1 Equal Important (1,1,1) 
2 The intermediate values between two adjacent scales (1, 2, 3) 
3 Moderately more important (2, 3, 4) 
4 The intermediate values between two adjacent scales (3, 4, 5) 
5 Strongly more important (4, 5, 6) 
6 The intermediate values between two adjacent scales (5, 6, 7) 
7 Very strongly more important (6, 7, 8) 
8 The intermediate values between two adjacent scales (7, 8, 9) 
9 Extremely more important (9, 9, 9) 


- Step 3: The matrix of pairwise comparison is updated based on the average of responses. 
Š Mi o Min 
M=]| : E : (3) 


Mnı °° Mnn 


- Step 4: Calculating the geometric mean of a fuzzy valuation matrix for each dimension. 


i a 1/n 
Yy = Ma fňxy) x= 1,2 EENS n (4) 


- Step 5: Merged 3 steps together and calculating the fuzzy weights of each criterion as shown in: 
- Step 5.1: Calculate the vector summation of 7,. 


Fe = Lira he (5) 


- Step 5.2: Calculate the power of negative one of summation vector. 
Bek DS tn 
(9) = ON? (6) 


- Step 5.3: Calculated the fuzzy weights of each criterion by multiply with reverse each 7, of 
summation vector: 


Ww, =F Q (i OF @......%,) 7 = wy, mwy, uwy) (7) 


- Step 6: Using the area center method, the non-fuzzy value is calculated by using (8). 


_ Wxtmwytuwy 


A, =e (8) 


- Step 7: Non -Fuzzy values is normalized, by using (9): 


Ax 


Ne = FRA G 


2.2. Artificial neural networks (ANNs) 
A multi-layer perceptron architecture called an ANN [38] is used to train and classify input patterns 
to produce the required output. We used a three-layer network in our ANN model, with 14 neurons in the 
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input layer, 16 neurons in the hidden layer, and two neurons in the output layer. The number of neurons in 
hidden layer was chosen empirically, providing the best performance. Backpropagation was used as an 
optimization tool to adjust the network's weights with slope decline after our ANN model was trained on the 
data. To improve the performance of the ANN model, we use the cross-entropy loss function to alter the 
weights by minimizing the error at each stage. The soft-max activation function is employed in the output 
layer to generate the final prediction. The fourteen criteria serve as the network's input. At the same time, the 
output layer distinguishes between diabetes presence (1) and diabetes absence (0). 

The equation of the cross-entropy loss function can be defined as (10): 


L = — Yiu, t; Log(pi) (10) 


where t refers to the ground truth value of sample i and p represents the probability of the sample resulted 
from soft-max activation function. 


2.3. Support vector machines (SVMs) 

SVMs are a type of machine learning and artificial intelligence method that is often employed in 
supervised learning. One of the most powerful prediction algorithms is SVMs. During the training process, 
SVMs develop a model for the dataset to predict point labels. SVMs learn a linear decision boundary to 
distinguish between the two classes based on a set of binaries labeled training vectors. The model is 
evaluated using the derived linear classification rule to categorize additional test instances [39]. A solid 
margin classifier, the simplest sort of SVM, is used to discover the linear classifier rule with the greatest 
geometric margin. Many optimization problems can be solved with linear SVM. The SVM static margin 
develops the hard hyperplane in the linearly separable state to obtain all data accurately sorted and increase 
the distance to the nearest training data points. In real-world data, datasets are frequently not linearly 
separable, necessitating the adjustment of the SVM. By using the soft margin principle, this modification is 
required to achieve a trade-off between maximizing geometric margin and decreasing classification error on 
the points of training data. The soft margin creates a hyperplane, allowing incorrect classification of difficult 
cases to increase the distance between them and the next entirely separated data samples [39], [40]. Suppose 
the training set is given as (x1, l4), (x2, l2), --, Xn, ln), where x is the feature set and l; are the labels with R 
classes, l; E€ {1,2, ..., R}. The primal problem for SVM is given as (11): 


mMiny bp 5 umeall Mall? + C Efa DR ey, PI 
s.t.: Wy Xe + by — (Way + bm) 2 1 — PP" 
P” > 0,i=1,..,n;m E {1,..,R} (11) 


where P; represents the distance of the point that is classified in the wrong class from the margin, 
W represents the weights, b is bias, and C is a constant coefficient that its value reflects the weight of the 
penalty. 


3. EXPERMENAL RESULTS AND AYSISNTAL 

To carry out the experiments, we used a dataset collected by Islam et al. [34]. The number of 
subjects in the provided dataset is 520 persons with fourteen attributes represent symptoms that may cause 
diabetes. In addition, two more attributes (age and gender) representing socio-demographic are included in 
the dataset. This study only focuses on symptom attributes of diabetes disease, including genital thrush, 
alopecia, weakness, obesity, muscle stiffness, delayed healing, polydipsia, polyuria, polyphagia, and visual 
blurring, irritability, sudden weight loss, partial paresis, and itching. This dataset has been collected by a 
conducting survey using questionnaires targeting people who have recently got diabetic or are still 
nondiabetic but have some symptoms. There are 404 female and 116 male persons, their age between (16- 
90). The allocation of symptoms among persons is illustrated in Figure 1. 

The pairwise comparison (PWC) matrix of the FAHP should be prepared initially to establish the 
FAHP diagnosis technique depicted in Figure 2. The importance of symptoms was rated using the doctor's 
judgment, as indicated in Table 2. The symptoms are abbreviated as: GT: genital thrush, AC: Alopecia, WN: 
weakness, OS: obesity, MS: muscle stiffness, DH: delayed healing, PD: Polydipsia, PR: Polyuria, PG: 
Polyphagia, VB: visual blurring, IA: irritability, SWL: sudden weight loss, PP: partial paresis, and LI: 
itching. The consistent comparison matrix, which is valid for experiments, was established after multiple 
revisions in the pairwise comparison matrix with the doctor's assistance. The achieved consistency ratio of 
the pairwise comparison matrix is 0.09, which is less than 0.1. 
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Figure 1. Symptoms distribution among 520 persons. Yes: indicates the symptoms presence and 
No: indicates the symptom absence 


Figure 2. Conceptual level of FAHP for diabetes prediction 


The value of the geometric mean of the fuzzy matrix is obtained as indicated in Table 3 after 
generating pairwise a comparison matrix. For example, according to (4), the product of the fourteen vectors 
yields the value of the geometric mean of fuzzy (rx) for the first criterion. As shown in: 


A 1 
f = (Miti Mey) =[(1*2*114x3x4*0.250+0.2*1x1x1x1%*1%*1)1; 
1 
(1*3*1%*5*4*5 *0.333*0.250*1*1*1*1*1+*1)1%; 
1 
(1*4*1*6*5*6*0.500 * 0.333 *1*1*1¥*1*1* 1)14] = [1.408; 1.258; 1.119] 
thus, the total values are found by the sum of the fourteen criteria of rą. The reverse values of total sum 
P (-1), shown in Table 3, are found by (total sum) ^-1, (14.227) *-1= 0.070. In addition, Increasing Order of 


P (-1) is obtained by exchange for the first column with for the third column as shown in the last row (INCR) 
of Table 3. 
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Tabel 2. Pairwise comparison matrix (PWC) matrix 


CRI GT AC WN OS MS DH PD PR PG VB IA SWE PP Lf 

GT (1) 234) (LLI 456 645) 456 Go. Gop GLD GL) LD (L) (LD (LD 
AC G52) GLD G52) 023) 234) 234) G22) 659 GLD GLD GL) (LD (LD LD 
WN (11 234) (0,1,1) (4,5,6) 678) 789 (LLD GL) GL) (LLD GL) GLb) (1,1,1) (1,1,1) 
SE Ee a eG ee a Ce et) 
ms Gibb GED CiD cao ALD Ge Gen CID bh bb ann dbo Gao Geb 
DH Ge) EED Goo G45) (23) GID Go2) Goo GL GL) GL) Ga. Ge Gy 
PD 23,4) 45,6) (1,1,1) 678) 456) 789 (LD G29 (LD GL GL) GL) GLD (1,1,1) 
PR (3,45) (9,9,9) (1,1,1) 789) (56.7) 667) (123) (LLD (LLD G1) GL) GLb (LLD (1,1,1) 
PG (LLD (LLD (L11 699 234 (LD (LLD (LLD OLD (LLD 234 629 G59 CLD 
VB (LLD GL) (LLD 6.67) (1,23) (LD (LD (LÐ GL) GL) (LD GL) GL) (1,1,1) 
IA (LD (LD (LD 234 (LD (LD (LD (LD Gp) LD GLD (LÐ Gi) (1,1,1) 
SWL (LD) (1,1,1) (1,1,1) 645) 234) 667) (LD (LLD (1,23) (,,D GL) (1,1,1) G45) (1,1,1) 
PP (LD (LD (LD 0.23) (1,23) 645) (LLD (LD (23) (LD (LD G25 LD LD 
Ud) (LLD GL) 089) 645) 234) 014D GL) G1) GL) GL) (LÐ Gi) (1,1,1) 


To calculate the fuzzy weights of each criterion (Wx), in (7) is then applied as follows: W1= 
[1.119*0.055; 1.258*0.063; 1.408* 0.070] = [0.062; 0.079; 0.099]. In the last step, the non-fuzzy weight 
value of each criterion (Ax) is found by taking the average of fuzzy weight values for each criterion using (8) 
as follows: A= [(0.062+0.079+0.099)/3] =0.080. Thus, the calculation of the total of Ax is obtained by 
summing of Ax values for each criterion. Finally, the normalization of non —Fuzzy values (Nx) is calculated 
by using (9). For example, Nı=A;/ total of N1= 0.080/1.008 =0.079. The weight calculation for all criteria is 
presented in Table 3. 


Table 3. The geometric mean of fuzzy (rx), with fuzzy weight (wx), with averaged weight criterion (Ax), and 
normalized weight criterion (Ni) 


CRI Tx W, Ax Nx 
Cl 1.408 1.258 1.119 0.062 0.079 0.099 0.080 0.079 
C2 0.925 0.801 0.681 0.037 0.050 0.065 0.051 0.050 
C3 1.703 1.618 1.515 0.083 0.102 0.119 0.101 0.101 
C4 0.322 0.245 0.000 0.000 0.015 0.023 0.013 0.013 
CS 0.578 0.000 0.324 0.018 0.000 0.040 0.019 0.019 
C6 0.627 0.461 0.385 0.021 0.029 0.044 0.031 0.031 
C7 1.936 1.727 1.547 0.085 0.109 0.135 0.110 0.109 
C8 2.193 2.034 1.830 0.101 0.128 0.154 0.127 0.126 
c9 1.426 1.240 1.104 0.061 0.078 0.100 0.080 0.079 

C10 1.243 1.194 1.122 0.062 0.075 0.087 0.075 0.074 

C11 1.051 1.000 0.952 0.052 0.063 0.074 0.063 0.062 

C12 1.727 1.575 1.379 0.076 0.099 0.121 0.099 0.098 

C13 1.312 1.160 0.964 0.053 0.073 0.092 0.073 0.072 

C14 1.449 1.385 1.306 0.072 0.087 0.101 0.087 0.086 

Total 14.227 15.698 17.900 1.008 1.000 


P (-1) 0.070 0.064 0.056 
INCR 0.056 0.064 0.070 


The highest weights were assigned among the symptoms to Polyuria (PR) followed by Polydipsia 
symptom (PD). The lowest weight is assigned obesity symptom (OS) followed by delayed healing 
symptom (DH). After calculating the weight of the criteria, the data entry of each criterion in the dataset is 
then multiplied by the assigned weight. The summation value of each subject is then calculated by summing 
the weighted criteria. Hence, each subject will have a certain value resulted from the total sum of the 
weighted criteria values. The mean of these weighted criteria is then computed and taken as the threshold 
value. The threshold is then compared to each value in the weighted sum connected to a specific criterion. If 
the weighted sum value is equal to or more than the threshold, the result is (1), indicating that the patient has 
diabetes. If the weighted sum value is less than the threshold, the result is (0), indicating a negative diabetes 
diagnosis. The value of the threshold obtained from our experiment was 2.43. 

To achieve the purpose of our comparison study, we also trained and tested ANN and SVM models. 
To process label imbalance, the dataset's minority class (negative classes) was oversampled substantially 
during training. The oversampling doubles the size of the negative examples and subsequently, balances the 
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two classes, producing better data representation during the training phase. The data was split into three 
categories: 60% for training, 15% for validation, and 25% for testing. By comparing the label of anticipated 
diabetes with the actual value provided along with the dataset, the performance for diabetes prediction is 
evaluated using various evaluation criteria. accuracy, sensitivity, specificity, precision, F-measure, and G- 
mean are among the evaluation metrics, which are defined as: 


TP+TN 
ACCURACY (AC) = SEP i 
TP 
SENSITIVITY(SEN) = a 5 
TN 
SPECIFICITY (SP) = aan a 
TP 
PRECISION(PR) = T T 
F — MEASURE = 2x sensitivity X precision m 


sensitivity + precision 


G — MEAN = , Sensitivity x Specificity (17) 


where TP, FP, TN, and FN represent true positive, false positive, true negative, and false negative, 
respectively. The results obtained from the three diabetes prediction models including, FAHP, ANN, and 
SVM, using the six-evaluation metrics have been depicted in Table 4. 


Table 4. The comparison among the three diabetes prediction models 


Model Accuracy Sensitivity Specificity Precision F-Measure G-Mean 
FAHP 0.7654 0.7312 0.82 0.8667 0.7932 0.7744 

ANN 0.8385 0.747 1 1 0.8552 0.8643 

SVM 0.8923 0.8793 0.9028 0.8793 0.8793 0.891 


The findings show that the FAHP model is an excellent tool for diagnosing medical disorders based 
on many criteria, where the relative importance (priority) of each criterion to the others is not well defined. 
The reported sensitivity shows 0.7312, 0.747, and 0.8793 from FAHP, ANN, and SVM, respectively. These 
values indicate that the methods could be used in clinical practice as a computer-assistant diagnosis tool and 
as a second observer. Yet, it will not replace the decision taken by an expert physician. It's worth noting that 
the assessment metrics for FAHP are slightly lower than those published for ANN and SVM since it was 
tested on the entire dataset with no oversampling, unlike machine learning models. Overall, the three diabetes 
prediction models produce competitive findings and good performance, demonstrating their possibility of 
detecting diabetes early. 


4. CONCLUSION 

In this paper, a comparison study has been conducting harnessing soft computing techniques to early 
predict diabetes from publicly available data. The generated models, which included FAHP, ANN, and SVM, 
were successful in detecting diabetes in a group of people. The FAHP approach was used to create, 
implement, and evaluate the interest of weighting criteria from diabetic symptoms. Furthermore, in terms of 
accuracy, sensitivity, specificity, precision, F-measure, and F-mean, the findings obtained from applying the 
proposed approaches demonstrated that it is promising in accurately and effectively predicting diabetes. The 
proposed diabetes prediction algorithms can be readily and smoothly applied to models for other diseases. 
For future research, we suggest studying the performance of these methods based on an ensemble learning 
paradigm. 
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